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Abstract. We consider the image of some classes of bipartite quantum states under a tensor 
product of random quantum channels. Depending on natural assumptions that we make on the 
states, the eigenvalues of their outputs have new properties which we describe. Our motivation 
is provided by the additivity questions in quantum information theory, and we build on the idea 
that a Bell state sent through a product of conjugated random channels has at least one large 
eigenvalue. We generalize this setting in two directions. First, we investigate general entangled pure 
inputs and show that that Bell states give the least entropy among those inputs in the asymptotic 
limit. We then study mixed input states, and obtain new multi-scale random matrix models that 
allow to quantify the difference of the outputs' eigenvalues between a quantum channel and its 
complementary version in the case of a non-pure input. 



1. Introduction 

1.1. Background. One of the most important questions in quantum communication theory is 
whether a quantum channel has additive properties or not [1, 4, 9, 10, 11, 12, 13, 14, 18]. If a 
channel <3? is additive for the Holevo capacity x(') m the sense that 3N £ N, Vn ^ N 

(1) = n X (d>) 

then the classical capacity of the quantum channel equals the Holevo capacity, giving a one-shot 
(non- asymptotic) formula for the classical capacity C(-): 

(2) C(<&) = lim - X (^ n ) = xm 

n— >oo n 



By definition, 



(3) = max 

W,pi) 



where, {pi,Pi} are ensembles and S(-) is the von Neumann entropy. The above additive property 
was conjectured to be true for all quantum channels until Hastings showed [13] existence of channels 
such that 

(4) S min ($ ® $) < S mi n($) + S min ($) 

Here, 5 m i n (-) is the minimal output entropy (MOE). Indeed, this result also gives counterexamples 
to (1) by [18, 12, 9] and as a result C($) ^ x(<&) in general. Note that 

(5) S min ($) = minS($(p)) 

p 

where the minimum is take over all pure inputs (rank-one projections). 

One of the most important results in the additivity theory of quantum channels is Hastings' 
proof of the additivity of the minimal output entropy [13]. The proof contains two disjoint parts: 
establishing a lower bound for the minimum output entropy for one channel, which was the main 
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contribution of Hastings, and an upper bound for the minimal output entropy of the product 
channel. The latter part, a very delicate question, is dealt with in a very crude manner, by simply 
bounding the minimum over the set of all bi-partite input states with the value of a single sample. 
It is thus of utmost importance to choose a state with small output entropy and Hastings uses an 
idea introduced by Hayden and Winter in [14]. This idea, putting a maximally entangled state (or 
a Bell state) through a product of conjugate channels has been, to our knowledge, the only example 
of a bi-partite input state with small output entropy. 

The purpose of this paper is to generalize this idea and to introduce new classes of interesting 
input states. Our results are derived using mainly the graphical calculus introduced in [3] and 
the technique of moments in random matrix theory. Our conclusion is that the Bell state gives 
asymptotically, among some large classes of input states, the output with the least entropy. 

Our results do not imply that the Bell state gives the largest violation of additivity - and proving 
such a result is certainly very difficult as it would require an optimal bi-partite version of the lower 
bounds of [13] - but they stand as a solid mathematical evidence towards the fact that the physically 
intuitive choice of the Bell state is indeed close to being optimal. 

The paper is organized as follows. In Section 2 we recall some basic facts about random quantum 
channels, the Hayden- Winter trick and the graphical notation needed to perform the integrals over 
the unitary group in the rest of the paper. Sections 3 and 4 generalize the idea of Hayden and 
Winter, both in the case of conjugate channels and in the case of identical channels. In Section 5 
we explain why the Hayden- Winter trick only works in the case of conjugate channels. Section 6 
introduces two models of mixed bi-partite inputs that quantify the difference between a quantum 
channel and its conjugate when the input is not pure. 

2. Review on random quantum channels and unitary integration 

2.1. Random quantum channels. A quantum channel $ : M^. n (C) — > M^ ou( (C) in the Stine- 
spring's picture is described as 

(6) <&(p) = Tr cden „ [VpV*] 
where 

(7) V : C din ->■ C denv ® C dout 

is an isometry. Here, di n , d ou t and d env are dimensions of input space, output space and environment 
respectively. Moreover, we induce the measure on the set of quantum channels from the Haar 
measure on the unitary group U{d ou t ■ d env ) in the following way. We endow the set of isometries 
by truncating a unitary matrix distributed along the Haar measure and we consider the image 
measure on the set of quantum channels. 

If we switch the roles of output and environment spaces a quantum channel becomes what is 
called its complementary channel $ c : M din (C) ->• M denv (C) [15, 16]: 

(8) $ C \p) = Tr c<w [VpV*] 

A quantum channel and its complementary channel share the same output eigenvalues for any pure 
input via Schmidt decomposition, see [15, 16] for details. Our interest in complementary channels 
is motivated by the fact that, often, the size of the environment is smaller than the output size, so 
output states are easier to study for <3? c . 

2.2. The Hayden- Winter trick. As stated above, in proving violation of additivity one needs a 
small enough upper bound for the minimum output entropy of product of two quantum channels 
to show that some entangled input gives an output with strictly less entropy than all the product 
inputs do. The idea introduced by Hayden and Winter is that the Bell state gives an output with a 
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large eigenvalue via the product of any channel and its complex conjugate. More precisely, consider 
the maximally entangled (or the Bell) state 

din 



(9) w) = 1 -Y d \m 

air). 



where are the canonical basis vectors. Then, it has been shown in [14, 3] that, for any channel 

$ we have 

(10) (<p m \(Q *)(\<p m )(<Pm\)\<Pm) > , ^ • 

This yields a lower bound for the largest eigenvalue of the output for In turn, this can 

provide a bound that is small enough for the MOE in order to ensure violation of additivity, using 
the inequality 

(11) S min ($ $) < S[($ *)(\<p m )(<p m \)]. 

Note that in the above inequality, $ (the complex conjugate channel of $) is defined by replacing 
U by U in the definition of 

2.3. Unitary integration. Let us start by recalling the definition of a combinatorial object of 
interest, the unitary Weingarten function. 

Definition 2.1. The unitary Weingarten function Wg(n, a) is a function of a dimension parameter 
n and of a permutation a in the symmetric group S p . It is the inverse of the function a i-> n#°" 
under the convolution for the symmetric group denotes the number of cycles of the permutation 
a). 

Notice that the function a i— >■ n^ a is invertible when n is large, as it behaves like n p S e as n — > oo. 
Actually, if n < p the function is not invertible any more, but we can keep this definition by taking 
the pseudo inverse and the theorems below will still hold true (we refer to [8] for historical references 
and further details). We shall use the shorthand notation Wg(<r) = Wg(n,cr) when the dimension 
parameter n is clear from context. 

The function Wg is used to compute integrals with respect to the Haar measure on the unitary 
group (we shall denote by U(n) the unitary group acting on an n-dimensional Hilbert space). The 
first theorem is as follows: 

Theorem 2.2. Let n be a positive integer and i = (i±, . . . ,i p ), i! = . . . ,i' p ), j = . . . ,j p ), 
f = ■ ■ ■ ,j' p ) be p-tuples of positive integers from {1, 2, . . . , n}. Then 



(12) / U lljl ---U ipjp U i , ij ,---U i/pj , dU = 

JU(n) 



E<5,- ,-/ . . . 5j ,-/ ... 6, Wg(n, to- 1 ). 

cr,TeS p 

If p p' then 

(13) f u tin ■ ■ ■ u ipjp l^...w-r du = o. 

JU(n) 1 l 

Since we shall perform integration over large unitary groups, we are interested in the values of 
the Weingarten function in the limit n — > oo. The following result encloses all the data we need for 
our computations about the asymptotics of the Wg function; see [2] for a proof. 
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Theorem 2.3. For a permutation a G S p , let Cycles(a) denote the set of cycles of a. Then 

(14) Wg(n,<7)= n Wg(n,c)(l + 0(n- 2 )) 

c£Cycles(a) 

and 

(15) Wg(n,(l,...,d)) = (-l)^Vi [J ^-j)' 1 

-d+l^j^d-l 

where Ci = r^ji j\ is the i-th Catalan number. 

As a shorthand for the quantities in Theorem 2.3, we introduce the function Mob on the sym- 
metric group. Mob is invariant under conjugation and multiplicative over the cycles; further, it 
satisfies for any permutation a £ S p : 

(16) Wg(n, a) = rc-^ + l CT l)(Mob(<7) + 0(n~ 2 )) 

where \a\ = p — jfa is the length of a, i.e. the minimal number of transpositions that multiply to 
a. We refer to [8] for details about the function Mob but what we have to know in this paper is 
that 

(17) Moh(a) ~ (-l)H 

when a consists of disjoint transpositions. 

We finish this section by a well known lemma which we will use several times towards the end 
of the paper. This result is contained in [17]. 

Lemma 2.4. The function d(a,r) = |c _1 t| is an integer valued distance on S p . Besides, it has 
the following properties: 

• the diameter of S p is p — 1; 

• d(-,-) is left and right translation invariant; 

• for three permutations <7i, 02, r £ S p , the quantity d(r, o\) + d(r, 02) has the same parity as 
d(a 1 ,o- 2 ); 

• the set of geodesic points between the identity permutation id and some permutation a £ S p 
is in bisection with the set of non-crossing partitions smaller than ir, where the partition 
tt encodes the cycle structure of a. Moreover, the preceding bijection preserves the lattice 
structure. 

2.4. Graphical calculus. We recall briefly in this subsection the graphical calculus method for 
computing unitary integrals introduced in [3]. For more details on this method, we refer the reader 
to the paper [3] and to other work which make use of this technique [5, 6, 7] 

In the graphical calculus matrices (or, more generally, tensors) are represented by boxes. Each 
box has differently shaped symbols, where the number of different types of them equals that of 
different spaces (exceptions are mentioned bellow). Those symbols are empty (while) or filled 
(black), corresponding to primal or dual spaces. Wires connect these symbols, corresponding to 
tensor contractions. A diagram is a collection of such boxes and wires and corresponds to an 
element of an abstract element in a tensor product space. 

The main advantage of such a representation is that it provides an efficient way of computing 
expectation values of such tensors when some (or all) of the boxes are random unitary matrices, 
into an efficient way to implement the Weingarten calculus: the delta functions in each summand in 
the RHS of (12) describes how we connect boxes. Each pair of permutations (a, r) in (12) eliminate 
U and U boxes and reconnect the wires originally connected to these boxes to get a new graph. 

This process for a fixed permutation is called a removal and the whole process which sums all 
the new graphs over the all permutations is called the graph expansion. Importantly, the graphical 
calculus works linearly and separated components are related by tensor product, as is assumed 
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implicitly above. In this setting, the Bell state is represented by a wire connecting two black 
symbols. 

We are allowed to make tensor-product of some spaces and decompose it into different spaces 
giving them different symbols from the original set of symbols. In this case, we must bear in mind 
that these two set of symbols represent the same product space when we expand the graph. 



3. Generalized Bell states for U x U. 

In the seminal paper of Hastings [13], violation of additivity was found when the dimensions of 
input and output spaces are much larger than that of the environment space of the channel. We 
also follow this scenario, by setting d ou t = n, d env = k and d; m = tnk. Here, the integer k and 
t € (0, 1) are fixed parameters and we let n go infinity to observe the asymptotic behavior. Since 
the output space (C n ) is larger than the environment space (C fc ), we investigate the complementary 
channels so that we have to study output matrices of smaller, fixed dimensions. As it was discussed 
in Section 2, the value of the minimum output entropy does not change, since the eigenvalues of 
the two partial traces of a rank one projector are the same, up to zeroes. 

We are thus interested in the following k x k random matrix: 

(18) Z n = $ C ®$ C (|Y> n )(V>n|) 

Here, \ip n ) is a fixed input vector for each n and <I> is random in the measure defined above. 

To represent the input \ifS) in the graphical calculus we add A and A* boxes on the wires of the 
Bell input (See Figure 1). 



A A 

s 

Figure 1. Generalized Bell state 
Algebraically, we consider a sequence of inputs 

tnk 

(19) \ip n ) = ^2 a,ij\i)\j) a,ij G C 

i,J=l 

Then 

tnk 

(20) A n =Y j aa\i)(j\ 

is used in the graphical calculus. In order to ensure normalized vectors, we choose A n 6 M tn fc(C) 
to be such that 

(21) Tr[A n A* n ] = 1 

We consider first cases when A is scaled properly as a generalization of the Bell state input, when 
interesting phenomena occur. Some of other cases will be treated later in this section. 
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3.1. Well-behaved input. In order to define well-behaved inputs, we introduce two assumptions 
on the asymptotic behavior of the sequence of input states A n . 
Assumption 1: 



(22) 



Tr [A n ] 
Vtnk 



m + O ( 4t 

n z 



for some m G C. Note that the similar limit for A* n is m. 
Assumption 2: 

(23) HAillc 



in 



With these two assumptions, we can prove the following result. Recall that the empirical eigen- 
value distribution of a matrix Z G M fe 2 (C) is the probability measure 

i=i 

where Ai, . . . , A fc 2 are the eigenvalues of Z. 

Theorem 3.1. Under Assumptions 1 and 2, the empirical eigenvalue distribution of the matrix Z n 



converges almost surely, as n 
(24) 



oo, to the probability measure 
' [5 Xl + (fc 2 - 1)6 X2 ] 



k 2 



where the Dirac masses are located at 



1 — t\m\ 2 



and 



A 2 



1 — t\m\ 



(25) X 1 = t\m\ 2 + k2 ..... ^ k2 

In other words, the output state has asymptotically the following eigenvalues: 
t\m\ 2 + , with multiplicity one; 

|2 



1 p^- , with multiplicity k 2 — 1. 



IP 

Proof. The proof uses the moment method and consists of two steps. First, we compute the 
asymptotic moments of the output density matrix Z and then, by a Borel-Cantelli argument, we 
deduce the almost sure convergence of the spectral distribution and of the eigenvalues. 

Step 1: We calculate the limit moments of Z n , using the graphical calculus, see Figure 2. Here, o 
and • correspond to the n-dimensional environment space of <3? c , and □ and ■ to the fc-dimensional 
output space, whereas o and ♦ correspond to the tn/c-dimensional input space. 



U 



U 



u* 



A — A 



u 



u 



3 



u 



A 



A* 



U 



II 



Figure 2. $ <g) $ with generalized input 

For fixed p G N the Weingarten sums are indexed by pairs of permutations (a, (3) G S 2 P - Here, 
we label the U and the U boxes in the following manner: 1 T , 2 T , . . . ,p T for the U boxes of the first 
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channel (T as "top") and l B , 2 B , . . . , p B for the U boxes of the second channel (B as "bottom"). 
We shall also order the labels as {1 T , 2 T , . . . , p T , 1 B , 2 B , . . . ,p B } ~ {1, . . . , 2p}. A removal r = 
(a, /3) £ <S| P of the random (U and ?7) boxes connects the decorations in the following way: 

(1) the white decorations of the i-th [/-block are paired with the white decorations of the a(i)-th 
U block; 

(2) the black decorations of the i-th [/-block are paired with the black decorations of the /3(i)-th 
U block. 

Next, we introduce two fixed permutations 7, S € S2 P which represent wires appearing in the 
diagram before the graph expansion. The permutation 7 represents the initial wiring of the □ 
decorations and 5 accounts for the wires between the ♦ decorations connecting boxes A or A*. 
More precisely, for all i, 

(26) 7 (i T ) = (i- if, 1 {i B ) = {i + l) B , and 5{i T ) = i B , 5(i B ) = i T . 

A difference between this calculation and the one in [3] is that the Bell state with • in that paper 
turned into ♦ with the A boxes here. So, for these wires we get "necklaces" with A or A* beads 
instead of just loops. Hence, we now can list of calculation elements in the graphical calculus for 
each {a, 13) G S2 P x S^: 

(1) "□"-loops: k*^ 1 ^- 

(2) "o"-loops: n# a ; 

(3) 'V-loops: none; 

(4) "necklaces" :/(/3), which is to be defined below; 

(5) Weingarten weights for the [/-matrices: Wg(a/? _1 ) = Wg(a _1 /3). 
Therefore, the Weingarten graph expansion formula (Theorem 4.1 of [3]) reads 

(27) ETr[Z*]= £ n# a k#^ la ^f{p) Wg(a~ 1 /3) 
Here, 



(28) f(0) = Q Tr [A 5 -- 1 • • • 

ceCyclc^- 1 ^) 

Here, ||c|| is the number of elements in c and s c ± . . . , s c \\ c \\ are defined such that 



(29) 



1 if the ith element in the cycle c belongs to T 
* if the ith element in the cycle c belongs to B 



Note that the above function f((3) is well-defined in spite of the ambiguity of s Cj i, because of the 
circular property of the trace. It is also well-behaved, in fact, since for each c 

(30) I Tr [A** 1 ■ ■ ■ I < \\A S ^ • • • p s =.IMI-i • \\A S ^W ||i 



I ell — 1 



<(-^) -v^ = 



n l-||c||/2 



we get the following bound 

(31) <n#^-v 
Hence, 

(32) ETt[ZP}< n #a n # (^ l5 )- p n- 2p -l^ 1/3 l asn^oo 

a,pes 2p 

The power of n in the RHS of (32) whole is 

(33) 2p - |a| +p - 1/3-^1 -2p- \a~ l p\ = p- {\a\ + |a _1 /3| + l/5~^l) < 
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Here, equality holds if and only if id — > a — > (3 — > 5 is a geodesic: 

(34) a = Y[r t , fi=Y[n 

ieA ieB 

where Tj = (i T ,i B ) and ACBC{l,... ) j)};we refer to [3] for the proof. Importantly, for these (3 
we have the following asymptotic behavior: 

(35) f(J3) = (tnk\m\ 2 ) W +o(Jp) 

Note that \(3\ = \B\. This implies that the power of n in (27) in fact becomes for all the a, (3 
which satisfy the geodesies condition id — > a — > f3 — > 5: 

(36) #a + \B\ -2p- \a~ l p\ = 2p - \A\ + \B\ - 2p - \B \ A\ = 
Here, \B\ = \B \ A\ + \A\. Next, we make use of the following lemma. 
Lemma 3.2. For a geodesic: id — > a — > (3 — > 5 we have 



(37) #(7~M 



\2 A = 
\\A\ A± 

(38) #(5(3) = p+\B\ 

(39) |a _1 /3| = \B\A\ 



We thus have the following approximation on ETr[Zn] (note that the estimate on the error order 
is not necessary here but will be so in Step 2) : 



(40) 



E[ZP] = Yl (n* a k#^f((3) + o(^]) Wg(«-^) 



id— >a->/3->i 



£ ^ #{7 " Q) {tk\m\ 2 ) m k- 2 ^ 1 ^ Mob(a- 1 /3) + O 



id— >a— >/3— >8 

Note that |a _1 /3| = |S\^4|. The first equality holds because Lemma 2.4 implies that permutations 
(a, f3) £ S2 P x S2 P off the geodesic make the power of n less by two or more; only even powers are 
allowed. The second inequality comes from (16) and (35). 
Hence, 

limETr[Z£] = £ ^ (tk\m\ 2 ) |B| fc- 2 P-l B \ A l(-l)l B \ A l 

AC.B 



(a) 

^ fcW(tife|m| 2 ) |B| A;-*-l B \ A l(-l)l B \ A l 



0=ACB 



(6) 

+ Yl k 2 {tk\m\ 2 f l k- 2 P-\ B \ A \(-l)\ B \ A \ 



®=ACB 



(c) 

The combinatorial sums above can be computed using the following multinomial identities. 
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Lemma 3.3. 



Y x |A| = (1 + xf and 
</>CAC{l,...,p} 

Y x^ A y B ^ = (i+x+yr. 

0CACBC{l,...,p} 



We obtain that 
(41) (a) 



k- 2p J2(k-tk- H 2 ) |A| x (tk ■ \m\ 2 ■ AT 1 • (-1)) 



\B\A\ 



ACB 



Also, 
(42) 

and similarly, 

(43) 

Therefore, 
(44) 



= k~ 2p (l + tk 2 \m\ 2 -t\m\ 2 ) p . 

( b ) = k- 2p y i tk ■ h 2 ■ k ~ x ■ (-i)) 151 = k ' 2p (i - *MT 



( C ) = k 2 - 2p Y ( tk ■ H 2 ■ k ~ l ■ (- 1 )) |B| = k2 ~ 2p (i - *HT ■ 



lim ETr[Z£l 



1 {k 2 - l)t\m\ 



k 2 + 



k 2 



+ (fe 2 " 1) 



1 ilml 



fc 2 fc 2 



which completes the proof of the first step. 



Step 2: We now move on to prove the almost sure convergence. Since this part of proof is very 
similar to that of Theorem 6.3 in [3] we only show the sketch of proof here. Via the Borel-Cantelli 
Lemma, all we have to prove is that the covariance series converges: 



oo oo 

(45) J>[(Tr[Z£]-ETr[Z£]) 2 ] = J> [(Tr [Z£]) 2 ] -(ETr[Z£]) 2 < 



DC 



n=l 



n=l 



which implies that for all p ^ 1 

(46) Tr [Z p ] -»• (44) a.e. as n -»• oo 

Indeed, this shows the convergence of empirical distribution. Also, by Carleman's condition, equa- 
tion (44) uniquely determines the measure as in (24). 
First, (40) implies that 

(ETr[Z£]) 2 =( Yl k^^itklm^k-^-^^Mobia-^) 

1 



(47) 



+ o 



We then calculate E[(Tr[Z£]) ]. In the diagram we have two identical copies of Tr Z p , which 
means that we have Ap pairs of U and U boxes. As a result, removals (a, (3) are defined for 
a, j3 £ S^p. However, importantly those two copies are initially separated. Namely, initial wires 
7, 5 G S2 P © Sip = S^p are written as direct sums: 

(48) 7 = 7 © 7 and 5 = 5 © S 
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Then, as before, we calculate the power of n, which is 

(49) 2p-(|a| + |a- 1 ^| + |^- 1 5|) ^0 

Here, "=" holds if and only if id — > a — > j3 — > 5 is a geodesic. Moreover, this geodesic condition 
implies that a and f3 can be written as 

(50) a = ai a 2 and /? = fa fa 

Here, pairs (a.\,fa) and (ct2,fa) are defined as in (34). 

Therefore, in the diagram all removals which matter as n — > 00 keep those two copies separated. 
Also, these removals have the following properties: 

#(7' 1 a) = #( 7 - 1 ai) + #(7~ 1 a 2 ), 

\P\ = \Pi\ + \fa\, \a-^\ = \a^fa\ + W 2 ~ 1 fa\ 

Mo^cr 1 ^) = Mob{a^ l fa) ■ MotyaJ 1 ^) 

As before, we get an approximation with the error of order 1/n 2 : 

(51) E(Tr[Z£]) 2 = \k*^ lai)+ *^ la ^ {tk\m\ 2 ) m+m 

id^ai^/3i— >8 
id->a2->'/32— >S 



k -4p-\ ai ^i|+|a a V2I Mob(a^ 1 /3i) • Mob^fa) 

Here, the error is of order 1/n 2 for the same reason as before. 
Finally, we see from (47) and(51) that 



+ o[ — 



(52) E (Tr[Z£]) 2 -(ETr[^]) 2 = 



which proves (45), and finalizes the proof. □ 

3.2. Consequence of Theorem 3.1. We start by considering some special cases of interest where 
the previous theorem applies. 

Example 1: the Bell state. The original, non-perturbed Bell state corresponds to the following 
matrix A: 

(53) A = -jL= ■ diag{l, . . . , 1} 

\Jtnk 

Then, 

(54) m = lim tnk ■ — - = 1 

n->oo tnk 

Hence, we have the following limit eigenvalue distribution: 

• t + ^ttti with multiplicity one; 

• with multiplicity k 2 — 1. 

In particular, when t = 1/k (the dimension of input space is n), we get 

• I + p- — p-, with multiplicity one; 

• p — p, with multiplicity k 2 — 1, 

recovering in this way the results of [3]. 

Note that this method yields better bounds for the largest eigenvalue (and thus for the entropy) 
than a direct application of the Hayden- Winter trick, which only gives a crude bound (Ai ^ t) on 
the largest eigenvalue of the output. 
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Dephased Bell state. Take the input to be the maximally entangled state with phases 

^ )= v^£ exp {S4 u)ii) 

-1. The corresponding matrix A is 

*-^W£M£}--'} 

0. Hence, we have the following flat limit eigenvalue distribution: k 2 eigenvalues equal 

Example 3: Normal matrices with asymptotic moments. We consider now a generalization of the 
previous two situations, where the matrices A n are normal and have asymptotic moments in the 
sense that the following limit exists: 

(57) v P ,oo, m „ = ^ ^0§§ ; = jzn<Mz), 

where [/, is a compactly supported measure on the complex plane. Since we want input states to 
be normalized, we assume m\ t \ = 1. Obviously, such matrices satisfy Assumption 1 of Theorem 
3.1, with m = m^o- Although Assumption 2 may not be satisfied, note that one can still get the 
bound in equation (31) for the *-moments of A n using normality and the hypothesis above; since 
this is the only place in the proof of Theorem 3.1 when one uses Assumption 2, the result holds. 

The next theorem, one of the main results of the paper, is an easy consequence of Theorem 3.1 
and presents the usual Bell state as the unique input state of a conjugate product channel that 
yields an output with minimal entropy, within the class of well-behaved input states. 

Theorem 3.4. Among all generalized Bell states described by a sequence of matrices A n satisfying 
the assumptions of Theorem 3.1, the one that achieves an output with minimal entropy is the usual 
Bell state, obtained by setting A n = Itnkl 'Vtnk. 

Proof. At fixed t, the largest eigenvalue of the limiting measure is an increasing function of \m\. 
Hence, the output with te least entropy is obtained for the larges achievable value of \m\. The 
normalization constraint TrL4 n A*] = 1 implies that one must have \m\ ^ 1, with equality if and 
only if A n is equal, up to a phase, to the identity matrix I tn ^/-\/tnk. □ 

Remark 3.5. As it was noted in the introduction, the previous result does not imply that the Bell 
state achieves the minimum in the formula for the minimal output entropy of a product of conjugate 
channels. Our result just states that the Bell state achieves the least entropy when compared to other 
input states which exhibit a nice eigenvalue behaviour, in the sense of the two assumptions appearing 
before Theorem 3.1. The global minimum for the output entropy could in principle be achieved on 
a state not having such a behaviour. 

3.3. Ill-behaved input. We examine in this subsection input states which do not satisfy Assump- 
tion 2. 

Case 1: When the fixed input is a product state, i.e., the corresponding matrix A is of rank one, 
we get the eigenvalue 1/k 2 with multiplicity k 2 . Since the input is a product state, outputs are 
products of outputs of those two channels. Hence the above conclusion is derived from [5]. 
Case 2: Instead of Assumptions 1 and 2 of the previous section, we set the following conditions 
on the matrix A: 

(58) rank(A) < n 1_e and ||-A||oo < 



Example 2: 
as bellow. 

(55) 

where i 2 = - 
(56) 

so that m = 
to k~ 2 . 
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Note that the first condition is equivalent to taking the dimension of input space to be of oder 
n 1 ^ 6 . The second condition is similar to Assumption 2, preventing the input state from having 
large Schmidt coefficients. 

Proposition 3.6. Under these two assumptions the empirical eigenvalue distribution of the matrix 
Z n converges almost surely, as n — > oo, to the probability measure S k -2. 

Proof. The expansion of graph works in the exactly same way as in the proof of Theorem 3.1. A 
difference is that we get a different (rough) upper-bound for from (31): 

(59) |/08)| Kn^&V-^-P) 
which implies 

(60) ETr[ZP}< n# a n( 1 - e K*^~ 1 V-rin- 2p -\ a ~ 1 ® as n -> oo. 

The power of n in the general term of the sum above is 

(61) V - (\a\ + la" 1 /?! + l/T^I) -t{p- l/rtl) 

(62) = -\a\-\a- 1 p\ + {\-e){p-\r 1 S\) 

which is non-positive. Indeed, it is obvious from (61) when p— | /? 1 <5 1 ^ and from (62) otherwise. 
To achieve equality, we need to have a = f3 = id, at least. In this case, /(id) = 1 and the power of 
n in (27) in fact becomes 0: 

(63) #(a) - 2p- |a _1 /3| = 2p - 2p - = 
Therefore, 

(64) lim ETr[Z£] = k*^ 1 ^ ■ y fc- 2j H Q ~ 1 ' 9 l = k 2 ■ (fc~ 2 ) p , 

and we recover the announced flat limiting distribution. □ 

As a final remark, note that in this case, the Hayden- Winter trick 

n 1_e 

(65) (RHS of (10)) ► as n oo 

n 

does not produce a useful bound for the larges eigenvalue of the output. 



4. Generalized Bell states for U x U 

We investigate in this section tensor products of two identical random quantum channels. Again, 
an input is fixed and a channel is drawn randomly according to the Haar measure. More precisely, 
we are interested in the following k x k random matrix: 

(66) Z n = ^ c ®^ c {\^ n ){^ n \). 

As in the previous section, \tp n ) is a fixed input vector for each n. The diagram of Z n is presented 
in Figure 3. We stud well-behaved inputs and we show that the output Z n has in this case an 
asymptotically flat eigenvalue distribution. The graph expansion method works in the same way 
except for small modifications on 7 and "necklaces" . 

The implications of the following theorem and a discussion on the reasons for which conjugate 
channels and identical channels give different results will be done in the next section. 

Theorem 4.1. Under those two assumptions in Theorem 3.1, the empirical eigenvalue distribution 
of the matrix Z n converges, as n — ^ 00, to 61/^2. Hence, Z n has asymptotically a flat spectrum. 
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u 



u 



A A 



u 



II 



II 



A A 



3 Cr 



u 



Figure 3. Diagram for the output of $ $ with generalized input 



Proof. We introduce a fixed permutations 7 £ 52p which represents the initial wiring of the □ 
decorations. More precisely, for all i, 

(67) i(i r ) = (i + l) T , ^(i B ) = {i + l) B 

The following is the list of calculation elements in the graphical calculus for each (a, f3) £ S2 P x S2 P - 

(1) "□"-loops: 

(2) "o"-loops: n #a ; 

(3) 'V-loops: none; 

(4) "♦"-"necklaces":g(/3), which is to be defined below; 

(5) Weingarten weights for the [/-matrices: Wg(a/3 _1 ) = Wg(a _1 /3). 

We calculate 



(68) 



E[TrZ£]= n #Q ^ a) <?(i3)Wg(a" 1 /3) 

a,/3eS 2p 



The above g(/3) is defined as follows. Set a relation: 

(69) i ~ j /?(i x ) = fc T and ^(i") = fc B {x,y = T or 5) 



and make the smallest partition /3 on {1, • • • ,p} such that any two numbers with the above relation 
belong to the same block. Here, each block corresponds to a "necklace" . Note that each "necklace" 
carries the same number of A and A* because A is connected only to U and A* only to U. Hence, 
g(/3) reads 



(70) 



g{P) = ] [ Tr [A Sb ^ {A*) tb + ■ ■ ■ A^Wi (A 

6eBlocks(/3) 



As before, ||fe|| is the cardinality of the block b and Sb,i,h,i = lor* depending on how they are 
connected. We upper-bound g((3) similarly as in the previous section. Since 



(71) Tr [A 3 ^ 1 (A*)^ 1 • • • A'MHI } < n l 
we have a rough bound for g(/3): 

(72) g{fi) < 



as n — > 00 



as n — > 00 



Here, #(/3) is the number of blocks in the relation induced by /3. 
Hence we have 



(73) 



a,/3eS 2p 



as n — > 00 
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The power of n as a whole in the RHS is 

(74) 2p - \a\ + #0) -p-2p- \a^f3\ = -p- (\a\ + \a~ l P\) + #(/3) 

< -p- 101 + #(0X0 

Note that #(/3) ^ p. It is easy to see that = holds if and only if id — > a — > (3 is a geodesic, and 
|0| = and #(/?) = p. I.e., a = /3 = id. Therefore, 



(75) 



lim ETr[ZP] ~ k#^ ■ 1 • AT 2p = fc 2 - 2p = A: 2 • ( -^Y 
n->oo nJ yk 2 J 



proving the flatness of the limiting distribution. Almost sure convergence is proven similarly as in 
the proof of Theorem 3.1, we leave the details to the reader. □ 

5. What distinguishes U <g> U in the graphical calculus 

As we have seen in the preceding sections, conjugate channels {U <g> U) and identical channels 
(U U) yield completely different behaviors. In this section we will explain the difference more 
intuitively in the graphical calculus framework and derive limit eigenvalue distributions in some 
other cases of theoretical interest: U ®U* and U <X> U T . 

5.1. Conjugate versus identical channels. In order to compare $ ® <& and <I> (g> $ we go back 
to Figure 2 and Figure 3 but A will be replaced by A = \fnA so that "necklaces" are proportional 
to loops in n. Of course, this rescaling produces a multiplicative constant 1/n in the graph, which 
appears as an exponent of — p in (76). 

The U U case is represented in Figure 4. Figure 4a shows the graphical calculus of n x & c (g) 
i\ipn) (i>n\) and we find the leading wires in Figure 4b. Remember that we let n go to the infinity. 
The same idea is applied to U U to get the figure 5. 



u 



u 



A A 



U 



u 



4— A | 



(a) Rescaled input 

Figure 4. Diagram for <3? c ® ® c (|Y> n ) (Vv. 



(b) Leading wires 



(8) U-c&se 



Fix a,(3e S2 P and let L(a, /3) be the number of loops (and necklaces). Then, the removal (a, /3) 
gives the following upper bound for the power of n: 

(76) -p +L(a,p) -2p- la" 1 /?! = -3p - |« _1 /3| +^(a,/3) 

" v ' " v ' 

rescaling loops Wg(a-^) 

In the case of U <8> U, it is easy to see from Figure 4b that L(a, f3) ^ 3p and equality holds if and 
only if a = ft = id. Note that, in this figure, U and U correspond to a solid square and a dotted 
square, respectively, so that same-coloured symbols from different kinds of squares will be joined 
through removals (a, (3). Hence, the necessary condition for (76) to be is a = (3 = id. 

On the other hand, in case of U <8> U we have a different bound: L(a, (3) ^ 4p. This is clear from 
Figure 5b in a similar way. Note that equality holds if and only if a = id and [3 = 8. In this case, 
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(a) Rescaled input (b) Leading wires 



Figure 5. Diagram for $ c 3> c (\ijj n ) (ip n \) : U U-ca.se 

|a _1 /3| = p and (76) = 0. In order for (76) to be 0, L(a,f3) may be any number between 2>p and 
4p including the edges. This gap by p gives room for some other a's and /3's than id to survive as 
n — > co. 

5.2. Two different models: U (8> U* and U (8> U T . We have so far investigated on the pairs 
U ®U and U ® U. So, it is natural to ask what would happen for U ®U* and U <8> U T . Here, we 
again get the flat limiting distribution. First, see Figure 6 where n x <& c ® ($*) c ' (\ip n ) (^nl) an d 
n x $ c <g) ($ T ) C (\tpn) (4>n\) are expressed in the graphical calculus. Here, \ip n ) is as in (19) but we 
set t = 1/k, i.e. the dimension of input space is exactly n. This is for convenience as is clear in the 
Figure 7 below. 




{a)U®U* {b)U®U t 



Figure 6. Diagrams for U ®U* and U <g> C/ T -cases 

We again turn Figure 6 into Figure 7 where only the leading wires are drawn. Again, we have the 
tight condition L(a, f3) ^ 3p for those two cases, which implies that only pair a = (3 = id survives 
as n — > co. Hence, we have the following result. 

Theorem 5.1. Under those two assumptions in Theorem 3.1, the empirical eigenvalue distribution 
of the matrix Z n converges almost surely as n — > oo to 6i/ k 2. Here, Z n is defined in U ® U* and 
U U T models in a similar way as for U ®U and U (8) U models. 

6. Non-pure input states for $ <g> l> 

It is a known fact, following from convexity considerations, that the minimum output entropy of 
a quantum channel is attained on pure states. However, an interesting by-product of considering 
mixed inputs is the fact that a product channel $ £3 <E> and a product of complementary channels 
§ c (g) $ c may have outputs with different spectra. This was obviously not the case for pure 
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(a) u® u* 



(b) U® U 1 



Figure 7. Leading wires for U <g)U* and U <8> U T '-cases 

inputs, since the partial tracing of a pure state on one of the other space does not alter the non- 
zero spectrum of the resulting density matrix. In this section we consider mixed input states 
for products of conjugated channels and we study the asymptotic eigenvalue distribution for the 
output. Our examples do not outperform the Bell state, but some interesting theoretical properties 
are derived. In the final section, we introduce a new class of random input states, correlated to the 
channel. 

6.1. Mixing a pure Bell state. We start by considering a fixed-rank perturbation of the Bell 
state considered in Section 3. As an input for the product of conjugate channels we consider a state 

M„ 2 (C) 3 X mixcd = y® \<Pn/l)(Vn/l\ ® y- 

The input matrix X mixc d has rank I 2 and we recover the results for the usual Bell states as a 
particular case by letting 1 = 1. 



u 



u 



< u* 



< u 



b 



Figure 8. Mixed input state for conjugated channels 



We investigate first the product of the direct channels, while complementary channels are dis- 
cussed in Theorem 6.2. This last theorem is the highlight of this section, the output state having 
in this case three different types of eigenvalues, of different multiplicities. Note that in this section, 
for the sake of simplicity, we state our convergence result not in the almost sure setting as we did 
before, but in the sense of the weak convergence of the empirical spectral distribution of output 
matrices toward some limit deterministic probability measure. 

Theorem 6.1. In the asymptotic regime n — > oo, k,l fixed, the random matrix M^2(C) 3 Z = 
[& c <g) <& c ] (X mixe( i) has limiting spectral distribution 



k 2 Ai + k 2 
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where 

111 

1 ~ kp + k 2 m 2 

2 k 2 m 2 

Proof. Using the graphical Weingarten calculus and counting the loops in the diagrams, we start 
from 

(77) ETv{Z p ) = r 2p (n/l)- p ^ n* a k*^^l*^{n/l)*^ 1& ^g{nk, a' 1 ^. 

a,/3€S-2 P 

Using the leading term in the Weingarten function, the general term in the above expression contains 
nf( a >@\ where 

/(a, P) = -p + #a + #(/3- 1 <5) - 2p - joT 1 /?! = p- (\a\ + \a~ l p\ + 

Using the geodesic inequality \a\ + (a^ 1 /?! + |/3 _1 <5| ^ \5\ = p, we conclude that only terms corre- 
sponding to permutations on the geodesic id — > a — > (3 — > 8 will contribute asymptotically. We 
have thus 

ETr(Z^) = (1 + o(l)) Yl l- 2 \P\k*^-^- 2p -\ a -^\{-l)\ a -^\. 

Now recall Lemma 3.2. Since in this lemma one has to distinguish the cases where A = (i.e. 
a = id) and A ^ 0, we compute separately the following quantities: 

Sl = r 2 \ B \k 2 - 2p -\ B \(-i)\ B \ = k 2 - 2p {\ - k-H- 2 f = k 2 (k- 2 - k-n~ 2 y. 

0=ACBC[p] 

s 2 = Y r 2 \ B \ k - 2p -\ B \{-i)\ B \ = k~ 2p (i - k-H- 2 y = (k- 2 - k~ 3 r 2 y. 

0=ACBC[p] 

s 3 = Y r^k^-^^-i)^ = k~ 2p (i + kr 2 - k- l r 2 y 

0CACBC[p] 

= ( fe - 2 + k~ l i~~ 2 - k~ 3 r 2 y. 

(here we used Lemma 3.3 again). The result in the statement follows then from 
ETr(Z p ) = (l + o(l))[5i + (5 3 - 5 2 )] 

= (i + o(i)) [(at 1 /- 2 + k~ 2 - k~ 3 r 2 y + (k 2 - \){k- 2 - k~H- 2 y] . 

□ 

Theorem 6.2. In the asymptotic regime n — > co, k,l fixed, the rank k 2 l 2 random matrix M n 2(C) 3 
Z = [$ <g) $] (X m i xe d) has limiting spectral distribution 

i fc 2 ; 2 -fc 2 fc 2 -i 

k^p 1 ~*~ fc 2 / 2 ~*~ fc 2 / 2 ' 

w/iere 

1 1 1 



A;/ 2 A; 2 / 2 fc 3 ! 2 

A 2 = J- 

2 A; 2 / 2 

A 3 1 1 



A; 2 / 2 A; 3 / 2 ' 
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Proof. As in the previous theorem, we start from the moment formula obtained via graphical 
calculus 

ETr(Z p ) = r 2p (n/l)- p Y k* a n* {a ^l # ^(n/l) #< -^ l6) Wg(nk, or 1 /?)- 

Note that the only difference between the above formula and equation (77) is that n and k got 
switched in the first two factors of the general term. The exponent of the large parameter n is 
given by 

f(a, f3) = -p + #(a~ 1 7 ) + #(/3" 1 (5) -2p- \a~ l (3\ =p- (^a) + (a" 1 /?! + \P~ ^D- 

The permutations a, (3 which minimize the above quantity are the ones on the geodesic 7—7-0—7' 
P — >■ S. This geodesic has been studied in [3], Section 6.1. We recall here that these permutations 
are indexed by pair of subsets C A C B C \p] and that one has 



2 A = 
\A\ A / 



A similar formula works for /3 and B. Also, 

\a~ 1 p\ = \B\A\ 

One has 

ETr(^) = (1 + o(l))(kl)- 2p k# a -\ B \ A h#P~\ B \(-l)\ B \ A \. 

0CAC_BC[p] 

As in the proof of Theorem 6.1, we expand the general term in the above sum in terms of \A\ and 
\B \ A\ and deal separately with the empty set cases. 

Sl = (kiy 2p Y ^(k- 1 )^ = (kiy 2p (i + k- 

s 2 = {kiy 2p y ( k ~^ Bl = ( ki y 2p f 1 ~ 1) 

s 3 = (ki)- 2p Y k 2 -\ B \ = (kiy 2p k 2 (i-^\ 

9=ACBC[p] ^ ' 

S 4 = Y (kir 2p k 2 = (kl)- 2p k 2 
9=A=B 



s 5 = Y (kiy 2p k 2 i 2 = (kir 2p k 2 i 2 



-A=B 



Then, 



ETr(Z£) = (1 + o(l))[S! -S 2 + S 3 -S 4 + S 5 ] 



i + k-l) +(i 2 -i)k 2 



(klfp [\ kj y ' \ k 

(1 + ^(1)) 



1 _L_ _L_ 



□ 
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6.2. Adapted mixed inputs for $ <g) <&. In the final section of this paper we introduce a new 
class of input states for product channels which we believe have interesting properties. We consider 
inputs which have the particularity of being random and correlated to the channel. Our approach is 
motivated by the fact that the quantum state reaching the minimum value in equation (5) depends 
a priori on the channel This obvious fact is hard to exploit in practice, and, to our knowledge, 
all attempts up to date to find "interesting" input states do not use this dependence. The states 
we introduce are adapted to the channel, in the sense that they are constructed using the same 
randomness as the channel (the unitary matrix U appearing in the Stinespring form of the quantum 
channel). In order to do that, we have to make one compromise: the input states we consider are 
no longer pure, but mixed. 

In Figure 9 we describe the input matrix X. Note that X is defined via the same unitary 
matrix U appearing in the channels $ and <£, hence the adaptedness (or the correlation). The third 
subfigure contains the global picture for the output matrix Z. Note that each unitary matrix U is 
wired to a copy of U*, hence one expects some cancellations to occur and, in the end, to obtain a 
low-entropy output Z. 

One could, in principle, apply the graphical calculus to compute the moments of Z. In practice, 
it turns out that this task is difficult since there are 8 boxes U in each group of Z. The Weingarten 
sum associated to a moment p of Z will be indexed by pair of permutations a, /3 € S^ p , making it 
very difficult to compute. We postpone this computation to further work. 

7. Discussion 

In this paper, we investigated eigenvalue distribution of outputs of tensor-product of quantum 
channels. 

We gathered evidence that product of a quantum channel and its complex congregate with a 
Bell state input gives the least output entropy. However this is not proven generally yet. In order 
to get smaller output entropy - if it is possible - a direction to explore could be to investigate a 
situation where both the channel and the input are random - and correlated in an appropriate way, 
as suggested in the adapted input model of Section 6.2. We leave this for future work. 
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(b) An adapted input X of rank k. 
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(c) The resulting output matrix Z of maximal rank k 2 . 
Figure 9. Adapted inputs for conjugated channels 
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